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Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1.136(a). In no event, however, may a reply be timely 
after SIX (6) MONTHS from the mailing dale of this communication. 
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- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 133£{;h GHIffJ OTdgtKXJ 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 

earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

I) 13 Responsive to communication(s) filed on 26 September 2003 . 
2a)M This action is FINAL. 2b)D This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 11, 453 O.G. 213. 

Disposition of Claims 

4) [Xl Claim(s) 22 and 23 is/are pending in the application. 
4a) Of the above claim(s) _ 

5) D Claim(s) is/are allowed. 

6) [x] Claim(s) 22 and 23 is/are rejected. 

7) D Claim(s) is/are objected to. 

8) D Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) D The specification is objected to by the Examiner. 

10)D The drawing(s) filed on is/are: a)Q accepted or b)D objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1.85(a). 
Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 

I I) D The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-1 52. 

Priority under 35 U.S.C. § 119 
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1 .□ Certified copies of the priority documents have been received. 

2. D Certified copies of the priority documents have been received in Application No. . 
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application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 
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RESPONSE TO AMENDMENT 

The response filed September 26, 2003 has been entered into the record. Claims 
22 and 23 are pending and under examination, all other claims having been canceled. 

The text of Title 35 of the U.S. Code not reiterated herein can be found in the 
previous office action. 

Rejections Maintained 
The preliminary amendment filed 9-21-00 stands objected to under 35 U.S.C. 132 
because it introduces new matter into the disclosure. 35 U.S.C. 132 states that no 
amendment shall introduce new matter into the disclosure of the invention. The added 
material which is not supported by the original disclosure is as follows: " EcoRI digest of 

purified for 2 months." which is inserted at page 5, line 18 after "1989". The objection 

is maintained for reasons made of record in Paper No. 8, mailed 7-1-02 and the last office 
action of record. 

Applicant's arguments have again been considered but are again not persuasive. 
Applicant argues Dr. Crothers declaration that opines that the conditions specifically 
disclosed in the Southern et al article would be considered "stringent". This is not 
persuasive again, the incorporation by reference must be specific. Declarant is basically 
attesting that any condition of hybridization is stringent. This is not persuasive, Southern 
et al does not define these conditions as "Stringent". Further, Southern et al is a 
guidebook to selection of general hybridization conditions and does not define any specific 
conditions as stringent. The specification does not direct one to any specific conditions 
set forth in Southern et al and erroneously indicates that Southern et al defines stringent 
hybridization conditions, it does not. The incorporation of the specific material inserted 
into the specification is not a result of a specific incorporation by reference by pointing in 
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particular to the relied upon teachings in Southern et al, but instead reflects an improper 
general incorporation by reference to any condition recited therein. Nowhere in Southern 
et al is the word "Stringent" used. Therefore, nothing in Southern et al teaches 
"stringent conditions" as specifically referenced by the specification. As such, the skilled 
artisan would not be pointed to any particular hybridization conditions recited by Southern 
et al, none are characterized by Southern et al as "stringent" as opposed to not stringent. 
As previously indicated, Southern et al is merely a guidebook for selection of conditions 
that allow for hybridization between two nucleic acids and these conditions are dependent 
on the particular structure (G+C) content of the nucleic acid under study. As such, there 
is nothing articulated in the specification to any particular set of conditions in Southern et 
al. The declaration of Dr. Crothers is not persuasive on this point, it is the Specification 
that must particularly point to those conditions which are considered incorporated by 
reference. This point is made completely clear in the office actions of record. Applicant 
argues that Dr. Crothers' conclusion with regard to experimentation conditions is based 
upon his examination of the experimental conditions explored and used in the Southern et 
al publication. This again is not persuasive, Dr. Crothers' opinion is not supported by 
extrinsic evidence for the recited nucleic acid and moreover supports the assertion of the 
examiner that the incorporation by reference is not specific to any subject material in 
Southern et al. This argument appears to support that the reference is an improper 
general incorporation and does not in fact specifically reference the incorporated material 
as required by the MPEP and Hawkins decisions as previously set forth in the all the office 
actions of record. Unlike In re Voss , the incorporation is not reasonably precise, it was 
not made for clarity and appears erroneous on its face. It specifically references the 
Southern et al "as defined by", but the reference did not define Such conditions, but 
recites multiple conditions. This application is unlike the situation in In re Voss, because in 
Voss, Applicant pointed to a the relied upon material "discussion of glass-ceramic material 
and their production". The incorporation by reference in this specification does not. The 
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entirety of Southern et al is drawn to selection of hybridization conditions and the 
variation of those to achieve specific hybridization. There is no discussion of stringency 
and no particular conditions referenced that are particularly defined as such. It is the 
specification that must clearly and unambiguously point to the incorporating material. 
Unlike the relied upon paragraph in In re Voss, that does such, this specification does not 
for all the previous reasons made of record. Applicant argues that stringent conditions 
are not empirical in nature and are in fact a combination of temperature and salt 
conditions. Applicant argues that the concept of stringency (i.e. the intrinsic specificity 
of the hybridization reaction depends on the annealing conditions employed was familiar to 
those of ordinary skill in the field. While this may be true, the concept of "stringency" 
does not define any particular conditions, it is relative. Kenneljet al "Principles and 
properties of nucleic acid hybridization", Progr. Nucl. Acid Res. Mol. Biol. 11:259-301, 1971) 
teaches that ". It should be emphasized that the extent to which mismatched hybrids can 
form is relative and dependent upon the "stringency" of the reaction conditions. The 
stringency refers to the extent to which the reaction conditions allow only completely 
complementary structures to form. Generally, stringency is proportional to temperature 
and inversely proportional to salt conditions." (page 293, first full paragraph). Thus, 
stringent conditions are not a particular set of defined conditions because stringency is a 
relative concept and "stringent conditions" can vary depending upon the temperature and 
salt conditions. The skilled artisan would not pick one set of conditions over any other set 
in view of the teachings of Southern et al because the specification does not teach nor 
does it contemplate the degree of mismatch contemplated. As such, this specification 
does not point to any particular conditions that it defines or Southern et al defines as 
"stringent". In contrast to Applicants assertions, the Southern et al reference uses 
multiple conditions of temperature in salt (see figures) and as such does not define any 
particular conditions. There are no one Set of conditions that are "defined as stringent 
conditions". Even if one were to consider all the conditions set forth in Southern et al. as 
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"stringent", then Applicant's incorporation by reference falls to the level of a improper 
general incorporation by reference. Applicant argues that as evidence that "stringent 
hybridization conditions" are not empirical in nature and need not be independently 
determined for every different nucleic acid, Applicants point to the Written description 
guidelines Example 9. This is not persuasive, it is relying upon hypothetical teachings. All 
of the Examples are hypothetical in nature. Declarant and Kennel's asserted concept of 
Stringency emphasizes the relative nature of "stringency" and that no particular conditions 
are art defined as stringent, nor does Southern et al define such. As to the nature of 
empirical nature of hybridization, the property of G+C content on Tm etc are all 
exemplified by Kennel as cited Supra. The Structure of the nucleic acids completely 
reflects its ability to form a complex with a target under specifically defined conditions. 
However, this specification does not reference with any particularity any specifically 
defined conditions. It relies upon an vague incorporation by reference utilizing language 
not present in the reference and the skilled artisan is left guess as to the meaning 
thereof and the particular conditions referenced. Declarant's opinion is not persuasive on 
this issue because it essentially states that all hybridization conditions are stringent, 
while the art defines stringency as a relative concept and not defined by any particular art 
defined conditions. 

The rejection is maintained for reasons made of record herein and all the previous 
reasons made of record. 

Claims 22 and 23 are rejected under the judicially created doctrine of 
obviousness-type double patenting as being unpatentable over claims 22, 26 and 28 of 
copending Application No. 08/719,821, now US Patent No. 6,706,872. Although the 
conflicting claims are not identical, they are not patentably distinct from each other 
because the species of full complements claimed in the copending application, would by 
definition hybridize to 5EQ ID NO:l and thus anticipate the instant genus claims . The 
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rejection is maintained for reasons made of record in Paper No. 8, mailed 7-1-02. 
Applicants indicate that a terminal disclaimer would be filed upon indication of allowable 
subject matter. The rejection is hereby maintained until resolution by terminal disclaimer. 

Claim 23 stands rejected under 35 U.5.C. 112, first paragraph, as containing subject 
matter which was not described in the specification in such a way as to reasonably convey 
to one skilled in the relevant art that the inventor(s), at the time the application was filed, 
had possession of the claimed invention. The rejection is maintained for reasons made of 
record in Paper No. 8, mailed 7-1-02. 

Applicants arguments have been carefully considered but are not persuasive. 
Applicants argue that the general incorporation by reference of Southern et al that 
recites numerous hybridization conditions and in view of the Declaration of Dr. Crothers is 
not persuasive. Applicant's arguments with respect to the insertion of new matter into 
the specification were not persuasive for reasons set forth above. 

Claim 22 stands rejected under 35 U.S.C. 112, second paragraph, as being indefinite 
for failing to particularly point out and distinctly claim the subject matter which applicant 
regards as the invention. The rejection is maintained for reasons made of record in Paper 
No. 8, mailed 7-1-02. 

Applicants argue that this is not well taken and should be withdrawn. This is not 
persuasive for reasons made of record previously and in view of the concept of stringency 
is relative and that Southern et al does not define any condition that is "stringent". This 
is not persuasive for reasons made of record previously and Applicant's arguments with 
respect to the insertion of new matter into the specification were not persuasive for 
reasons set forth above. 
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Claims 22 and 23 stand rejected under 35 U.S.C. 102(b) as being anticipated by 
Sigma Molecular biology Product Guide, 1991, pages 54-56). The rejection is maintained 
for reasons made of record in Paper No. 8, mailed 7-1-02. 

Applicants arguments have been carefully considered but are not persuasive. 
Applicants argue that there is no mention of stringent hybridization conditions in Sigma at 
all and therefore it can not anticipate the claimed invention. This is not persuasive, the 
claims are drawn to products by function. Applicants also argue that the rejection is 
incompatible with the 112, first paragraph rejection of record. This is not persuasive 
because as Applicants well know, the rejections are made under different statutes that 
have different specific requirements. The products of the art inherently have the 
recited function of hybridization based upon their 100% identity across the recited 
nucleotides. Applicants have not provided any extrinsic evidence that the products of the 
art would not hybridize under any Specific set of allegedly "stringent" conditions. 

Since the Office does not have the facilities for examining and comparing 
applicant's product with the product of the prior art, the burden is on applicant to show a 
novel or unobvious difference between the claimed product and the product of the prior 
art (i.e., that the nucleic acid of the prior art does not possess the same functional 
characteristics of the claimed nucleic acid). See In re Best, 562 F.2d 1252, 195 USPQ 
430 (CCPA 1977) and In re Fitzgerald et al., 205 USPQ 594. 

Claims 22 and 23 are rejected under 35 U.S.C. 112, first paragraph, as containing 
subject matter which was not described in the specification in such a way as to reasonably 
convey to one skilled in the relevant art that the inventors, at the time the application was 
filed, had possession of the claimed invention. The following written description rejection 
is set forth herein. 
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Applicants arguments have been carefully considered but are not persuasive. 
Applicants argue a Declaration by the inventor Dr. Barnwell. Dr. Barnwell attests that 
SEQ ID NO: 2 is in fact the full-length open reading frame and attempts to explain the 
differences between the apparent molecular weights of the protein as compared with the 
calculated molecular weight. This is not persuasive, the specification specifically teaches 
that it is a fragment. This specification the teachings of which are sworn and attested to 
by Dr. Barnwell. Further, Dr. Barnwell is attesting that they have disclosed the full-length 
open reading frame of 1011 amino acids and that the sequence of depicted in SEQ ID NO: 2 
is the complete open reading frame. This is not persuasive on its face. SEQ ID NO: 2 is 
1018 amino acids in length and is described as a partial sequence. Applicant is arguing that 
the full length is 1011 amino acids long. This argument is entirely incompatible with the 
written description of the disclosure. Applicants are attempting to redefine the invention 
as something that is shorter than that described by SEQ ID NO: 2 which is 1018 amino 
acids long. It is the specification that must teach conception of what is the invention. 
The specification teaches that the genomic fragment is an incomplete reading frame. 
Declarant, discovering his error in characterization well after filing this application can 
not seek to "correct" the specific written description thereof of the specification as filed, 
by assertion that the nucleic acid of SEQ ID NO:l is the full-length open reading frame, 
when the specification clearly indicates otherwise. The declaration of Dr. Barnwell, in 
essence an attempt to redefine the described invention of the specification after the 
filing date. The specification does not state that the complete gene codes for 1011 amino 
acids with a calculated molecular weight of 112,725 Da. There is no recognition of this 
fact in the specification as filed. Further, one skilled in the art would take the 
characterization of the gene as incomplete in view of the lack of a recited conventional 
start codon on the protein, the discrepancy of the molecular weights. Declarant provides 
many different possible explanations, but no extrinsic evidence that the particular system 
used provides for a protein having these characteristics. Further, the inability of a 
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indicating that a misrepresentation in a patent specification will only defeat the 
enablement of an invention if the skilled artisan fails to recognize the mistake and relies 
on it to practice the invention. This is not persuasive, the question of Wahl involves a 
best mode inquiry and whether the inventor failed to disclose particular manufacturing 
procedures beyond the information sufficient for enablement and that such failure gives 
rise to an inference that he concealed information which one of the ordinary skill in the 
art would not know. The issue at hand is conception of the full-length open reading frame 
by way of written description and not concealment of best mode or enablement. Further, 
the missing descriptive information is not a routine manufacturing choice. For written 
description of nucleic acids,the nucleic acid itself is required. See Fiers v. Revel , 25 
USPQ2d 1601, 1606 (CAFC 1993) and Amqen Inc. V. Chugai Pharmacentical Co. Ltd.. 18 
USPQ2d 1016. This written description inquiry is independent from enablement Vas-Cath 
Inc. v. Mahurkar , 19 USPQ2d 1111, makes clear that "applicant must convey with 
reasonable clarity to those skilled in the art that, as of the filing date sought, he or she 
was in possession of the invention. The invention is, for purposes of the 'written 
description' inquiry, whatever is now claimed." (See page 1117.) The specification conveys 
that the nucleic acid of SEQ ID NO:l encoding the protein of SEQ ID NO: 2 is not the 
full-length open reading frame. The specification does not clearly allow persons of ordinary 
skill in the art to recognize that the inventors invented the full length open reading frame 
as argued, specifically teaches to the contrary and one skilled in the art would not have 
any specific framework of reference to believe otherwise, since other similar proteins and 
nucleic acids have not been disclosed in the art. Therefore, unlike Example 8, there is no 
frame of reference for either completeness of the Sequence or function of the protein 
encoded by the nucleic acid, and the teachings of the specification would necessarily be 
taken as true. Applicants argue that because there is a single species and reduction to 
practice of a single species is representative of the genus because one skilled in the art 
would not expect substantial variation in the sequence. This is not persuasive, as 
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bacterium to splice any introns, would apparently provide for a protein of increased rather 
than decreased molecular weight because the intervening sequence (introns) would be 
expressed by the bacterium. Declarant provides no evidence that the this is the case in 
the described expressed by the method set forth in specification. With respect to in 
situ proteolysis of the expressed protein, this is hypothetical and does not provide 
extrinsic evidence that the protein produced by the method described in the specification 
undergoes in situ proteolysis. There is not a single post-translational modification 
described in the specification with respect to this protein. Therefore, the discrepancy of 
the molecular weights because of the inability of the bacterium to perform certain post- 
translational modifications is hypothetical. No posttranslational modifications of the 
protein encoded by the nucleic acid are taught nor contemplated by this specification. 
Declarant also attests that discrepancies are common among Plasmodium proteins because 
they contain repeated proline rich amino acid motifs that render them less susceptible to 
denaturation by SDS. This is not persuasive, there is no evidence to support this problem 
associated with SEQ ID NO:2. Applicants submit that the instant Specification enables 
the complete open reading frame. Enablement is not the issue here. Written description 
of the full length open reading frame and whether or not, Applicants at the time of filing, 
as described by the specification at the time of filing, conveyed possession of the full- 
length open reading frame. The examiner contends that the written description of the 
specification did not, convey possession of the full-length open reading frame and as such 
is not analogous to Example 8. Further, in contrast to Example 8, no asserted function is 
associated with the protein fragment of SEQ ID NO:2 and coding for a surface protein of 
P. vivax\s not a function of the protein, it is not a description of what it does, but where it 
is located. Location is not equivalent to function of a protein. Applicants were not in 
possession of the genus comprising SEQ ID NO:l because they do not describe the full- 
length open reading frame, nor do they describe the function of the protein. Applicants 
argues Wahl Instrument v Acvious Inc. 950 F2.d 1575 21 USPQ 2d 1123 (Fed Cir. 1991) 
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previously set forth, the claims encompass variants that do not encode proteins. The 
specification does not disclose that the disclosed species of SEQ ID NO:l was used in any 
manner to hybridize to other nucleic acids. Unlike Example 9 of the written description 
guidelines, there is no disclosed example where SEQ ID NO:l is used for the isolation of 
hybridizing nucleic acids. Additionally, the methods of Example 9 used defined conditions 
and recite a specific functional activity and were demonstrated to have the asserted 
activity. No such information is provided in this specification, no specific function is 
disclosed, SEQ ID NO:l was not used to isolate hybridizing sequences and as such the 
Specie of SEQ ID NO:l is not representative of a genus of hybridizing sequences with 
identical biological function. Therefore, the skilled artisan given the written description 
of the specification would clearly recognize that Applicants had no conception by way of 
written description of the genus of nucleotides hybridizing under stringent conditions, nor 
were they in possession, of the full-length open reading frame. 



Status of Claims 
Claims 22 and 23 stand rejected. 

Conclusion 

THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
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extension fee pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the 
advisory action. In no event, however, will the statutory period for reply expire later than 
SIX MONTHS from the mailing date of this final action. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published 
applications may be obtained from either Private PAIR or Public PAIR. Status information 
for unpublished applications is available through Private PAIR only. For more information 
about the PAIR system, see http://pair-direct.uspto.gov. Should you have questions on 
access to the Private PAIR system, contact the Electronic Business Center (EBC) at 866- 
217-9197 (toll-free). 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Patricia A. Duffy whose telephone number is 571-272- 
0855. The examiner can normally be reached on M-F 6:30 pm - 3:00 pm. If attempts to 
reach the examiner by telephone are unsuccessful, the examiner's supervisor, Lynette 
Smith can be reached on 571-272-0864. 

The fax phone number for the organization where this application or proceeding is 
assigned is 703-872-9306. 




Primary Examiner 



Art Unit 1645 



Copied from 09880748 on 11/01/2004 




fTO5BAUb(0M») 



Substitute for torn 1448M&PTO 

INFORMATION DISCLOSURE 
STATEMENT BY APPLICANT 

(lit* a* irony s/iect* «* mctamiy) 


Complete If Known 


Appfcaton Number 


0W667,130-Conf.#8596 


Fifing Oats 


September 21, 2000 




John W. Barnwell 


ArtUnft 


1645 




P. Duffy 


Sheet) 2 | of | 2 


Attorney Oocfcel Number 


05986/10076B6-US5 



JUL 09 2m 





CW" 


Panton et at., Mol. and Blochem. Paraattni. 35:149. 1989. 








or 


Rock et at.. Parasitol. 95L209. 1987. 








cr 


Thomas. A.W. et at. Parasite Immunol. 12:105. 1980. 








C2" 


Vernicketal.. Nucleic Acids Res. 16(14):6883. 1988. 








CAT* 


Wellems et at.. Proc. Natl. Acad. Scl USA 83:6065-6069. 1 986. 








CB1** 


Wilson at al., Parasitol. 71:183. 1975. 








CC1** 


Wilson et al.. The Lancet. July 26, 1969. oo. 201-204. 








CD1 M 


Wilson et al., Nature 284:451. 1980. 








CE1" 


Wilson et al.. int J. Parasitol. 3:511. 1973. 








CF1" 


Yang Y-F et al., Mol. and Biochem. Parasitol. 26:61. 1987. 





Copied from 09880748 on 11/01/2004 



Notice of References Cite* 



Application/Control No. 
09/880,748 


Applicant(s)/Patent Under 
Reexamination 
RUBEN ET AL 


Patricia A. Duffy 


Art Unit 
1645 


Page 1 of 1 



U.S. PATENT DOCUMENTS 



* 




Document Number 

Country Code-Number-Kind Code 


^WFYYYY 


Name 


Classification 




A 


US- 










B 


US- 






-^.rY-^^r^ 




C 


US- 










D 


us- 










E 


us- 










F 


us- 










G 


us- 










H 


us- 












us- 










J 


us- 










K 


us- 










L 


us- 










M 


us- 









FOREIGN PATENT DOCUMENTS 







Document Number 
Country Code-Number-Kind Code 


Date 
MM-YYYY 


Country 




Classification 




N 














0 














P 














Q 














R 














S 














T 













NON-PATENT DOCUMENTS 



Include as applicable: Author, Title Date, Publisher, Edition or Volume, Pertinent Pages) 



v 



w 



X 



*A copy of this reference is not being furnished with this Office action. (See MPEP § 707.05(a).) 
Dates in MM-YYYY format are publication dates. Classifications may be US or foreign. 

PTO-892 (Rev. 01 -2001) Notice of References Cited Part of Paper No. 042004 



Copied from 09880748 on 11/01/2004 



Principles and Practices of Nucleic 
Acid Hybridization 

David E. Kennell 

Department of Microbiology, 
Washington University School 

of Medicine 
St. Louis, Missouri 

I. Introduction • 259 

II. Yield of Hybrid as a Function of Reactant Concentration . 262 

A. The Law of Mass Action 264 

B. The Reaction Is Reversible 267 

C. Estimation of Equilibrium Constant for RNA-DNA Reaction 271 

III. Kinetics of the Hybridization Reaction 275 

A. Second-Order Kinetics and Estimation of Genome Hetero- 
geneity 275 

B. First-Order Kinetics and the Reaction of RNA with DNA 276 

IV. Estimating Relative Transcribing Activities of DNA Sites . 281 

A. A Simple Case: Fraction of Genome for rRNA . 281 

B. A Complex Case: Total Cell RNA 282 

V. Quantitative Limitations of Hybridization .... 283 

A. Kinetic Considerations . . 284 

B. Thermodynamic Considerations . . - . • • 284 
VI. Use of Competitor RNA to Estimate Specificity ... 286 

A. Some Basic Considerations ....... 287 

B. A Not So Simple Case: Estimating the Fraction of Pulse- 
Labeled RNA That Is rRNA 287 

C. A Very Complex Case: Estimating Relatedness of RNA 
Populations from Different Cells 288 

VII. Hybridization of Nucleic Acids from Eucaryotic Cells . 292 

A. Factors Adding to the Complexity of Interpretations . 292 

B. Stringency of Reaction Conditions and Thermal Stabilities 

of Complex 293 

C. Expected Reaction Kinetics for Hybridization of Mouse 
RNA to DNA 294 

D. The Reaction Is Limited for Thermodynamic and Kinetic 
Reasons 296 

E. Presaturation Competition 297 

VIII. Conclusions 29 ^ 

References 299 



I. Introduction 

The formation of stable complexes of RNA with denatured strands of 
complementary DNA (1, 2) provides a biochemical assay that relates 



Copied from 09880748 on 11/01/2004 



DAVID E. KENNELL 



DNA to the immediate product o£ gene expression. In the ten years since 
the initial observation, RNA -DNA hybrid formation has been an im- 
portant part of many thousands of research papers. Probably no other 
assay has found such widespread use in molecular biology. 

Since the chemistry of RNA • DNA hybrid formation is similar to that 
of DNA -DNA renaturation, most of its basic aspects are common to the 
DNA -DNA reaction. The chemistry of denaturation and reassociation of 
DNA are discussed in the extensive reviews by Marmur, Rownd, and 
Schildkraut (3) and Felsenfeld and Miles (4). Most chemists still believe 
that the major forces holding together a double-stranded nucleic acid 
are derived from the cooperative "stacking-free energy" (3,5). Hydrogen 
bonds, while probably important for the specificity of the reaction, are 
thought to play a minor part in the total stability of the duplex in solution. 
In any case, many physical and chemical agents cause strand separation 
or denaturation of DNA (3). The two most commonly used arc heat and 
high pH. However, many organic molecules are also effective. Formamidc, 
in particular, has been useful (6-8) because at high concentrations it 
causes denaturation at room temperature without intrastrand scissions 
(6,7). Although the rate of breakage of phosphodiester or glycosyl bonds 
in RNA is relatively slow at the usual annealing temperatures (9), such 
breakage has been observed with ribosomal RNA (7, JO). Tims, form- 
amide is useful in studies in which recovery of the intact hybrid RNA is 
desired or when very long annealing times are required. 

Many factors affect the reassociation of denatured DNA. In general, 
the optimal temperature of reassociation (TV) is related to the melting 
temperature (T m ) and is about 25° lower (3, 11). The rate of renatura- 
tion is dependent on several factors (3) including salt concentration, 
G -f- C content, and the viscosity of the solvent (12, 13). Wetmur and 
Davidson (14) showed that the rate was proportional to the square root 
of the strand length; longer strands react more rapidly than shorter ones. 
Furthermore, the rate of renaturation obeys second-order reaction kinetics 
(3, 14); this means that the rate of reaction is a function of the reactant 
concentrations. It follows that strands from a complex genome will re- 
nature more slowly than those from a less complex genome, given that 
all other parameters, such as size, total concentration, G -f C content, 
etc. are the same (15). 

The direct relationship between the product of concentration of com- 
plementary strands to the rate of reassociation suggested that the initial 
rate-limiting step in reassociation is a nucleation event caused by the 
collision and formation of a few correct base pairs (3, 14). The subse 
quent "zippering" reaction is presumably fast and results in the ultimate 
formation of stable complex. However, there is some minimal length 
below which the duplex will not be stable. Various experiments have 
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been performed with synthetic polynucleotides (16-19, 19a) or with 
natural nucleic acids that had been sheared to defined sizes (20, 21) in 
order to estimate this minimal length. These studies have been reviewed 
by Walker (22) and Thomas (23), and it would appear that, depending 
on G -f- C content (11, 21a, 24), the minimum size for a stable complex is 
from 10 to 20 nucleotides. The thermal stability rises sharply for longer 
lengths so that, depending on the G -f C content (21a), the stability of 
a complementary duplex of 25-50 nucleotides approaches that of any 
much longer complex (15, 18, 20, 21a). 

The thermal stability of a nucleic acid duplex is extremely sensitive to 
the presence of mismatched nucleotide pairs within the polymer strands 
(25-27, 27a). The reduced stabilities of many associated complexes from 
cells of higher organisms results from variable extents of mismatching 
within duplexes in the hybrid population (15, 20, 22,27). There seems to 
be a need for more quantitative studies with defined synthetic polymers, 
to determine the effects on thermal stabilities of defined types of mis- 
matching within the complex. For example, it is not clear whether mis- 
matching normally occurs in clusters of contiguous bases or whether the 
noncomplementary bases are scattered at random throughout the 
molecule. 

Many studies have been possible only because of technical improve- 
ments in the procedures of hybridization. These have been reviewed by 
Gillespie (28). The use of agar as a supporting medium for the denatured 
DNA (29) was widely used until introduction of nitrocellulose filters for 
trapping the DNA (30). The filter method has many advantages with 
respect to simplicity, reproducibility, and relatively low nonspecific back- 
ground sticking. However, there are certain hybridization studies, such 
as those involving high RNA/ DNA inputs, for which it may be advan- 
tageous to react in solution before trapping the hybrid on a filter (31). 

Single-stranded DNA and RNA can be separated from duplex struc- 
tures by chromatography on calcium phosphate (hydroxylapatite) col- 
umns (21a, 32-34). This supporting medium is useful for many kinds of 
studies since it is stable over a wide range of temperatures and to organic 
solvents. Furthermore, it provides an instrument for analysis or purifica- 
tion of preparative amounts of nucleic acids (15, 35, 35a). It is also useful 
for studying oligonucleotide-polynucleotide complexes (19a). 

Progress in another area of biology involving hybridization has been 
of great significance for study of gene expression. It has become possible 
to study RNA synthesized from one or a few genes by use of DNA carry- 
ing only a small fraction of the genome. So far, this last development 
lias been of great value for an analysis of the expression of die genes of 
phage A. (36) and the E. coli genes responsible for tryptophan synthesis 
(37, 38). Limited segments of the bacterial chromosome can be carried 
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either on episomal F factors (39) or as a part of a transducing bacterio- 
phage. Transducing phages are much hetter suited than episomal carriers 
for purification of preparative amounts of DNA that can be enriched 
100-fold or more for selected genes of the bacterial chromosome. As 
might be predicted there is a high degree of background homology be- 
tween either an episomal factor (39) or a generalized transducing phage 
(40) and total E. coli UNA. Thus, specialized transducing phages have 
been the most useful. Signer and Beckwith transposed the lac operon 
close enough to the chromosomal attachment site of A. or att 080 to allow 
their transduction (41) and procedures have been outlined to construct 
such strains with any selectable marker (41, 42). 

For any assay to be useful it must be quantitative. In the next sections 
some features of the hybridization reaction that bear on quantitative inter- 
pretations are discussed. Technical considerations such as optimal salt, 
temperature, volume, etc., have been reviewed (28, 30,31). Rather than 
dwell on these, it might be useful to discuss the factors that determine 
the yield in any complex reaction that is the summation of several hun- 
dred reactions. Such is the case in the reaction of RNA from a cell to its 
DNA; however, many of the following comments could apply to any 
multireaction system. 



II. Yield of Hybrid as a Function of Reactant Concentration 

The complementarity of an RNA to a given DNA is monitored in the 
hybrid reaction by the percentage of the reacted RNA that forms a stable 
hybrid. Ideally, at a given RNA/ DNA input, all RNA molecules for 
which excess complementary DNA sequences are available should be 
bound at equilibrium. Unfortunately, a fraction of the RNA for which 
complementary DNA sites are available may not be bound. This is shown 
most easily by observing the fraction of RNA bound when the concentra- 
tion of RNA and DNA are varied but their ratio is kept constant. In this 
situation, if complementarity were the only variable, the fraction bound 
should remain constant. This is clearly not the case (31) (Fig. 1). 

The concentration curves show that the equilibrium constant K, for 
the reaction: RNA + DNA ^± RNA • DNA is such that at sufficiently low 
concentrations of reactants, a significant fraction of the pulse-labeled RNA 
is not bound at equilibrium, even though sites are available. However, if 
the label is only in the stable RNA molecules that themselves represent 
about 97% of the total RNA of E. coli, then the yield is not concentration- 
dependent at similar RNA/ DNA inputs (31, 43). 

These differences between RNA labeled only in stable components and 
RNA that is pulse-labeled reflect the very different distributions of isotope 
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Fig, 1, Hybridization efficiency as a function of nucleic acid concentration. Pulse- 
labeled RNA in 0.5 ml of 0.9 M NaCl + 0.09 M Na citrate was allowed to react 
for 20 hours at 66°C with denatured DNA. The /xg RNA/Vg DNA inputs, indicated 
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(fraction of RNA hybridized) versus R for a given R/D will give a con- 
centration-dependence curve. Such families of curves for different RfD 
inputs are most easily generated by using a computer to solve the quad- 
ratic equation for different values of R, D and EL One such family is 
shown in Fig. 2 for the arbitrarily chosen value of K — 10~*. 

Each of the four curves represents binding of one of four species of 
RNA (Rj through R,) and each species is ten times more concentrated 
than a preceding species; this gives a 1000-fold range of concentrations. 
In all cases, the slopes approach a value of 1 at very low inputs of DNA 
and RNA; i.e., the fractional yield becomes proportional to the amount 
of DNA reacted. This follows from rearrangement of Eq. ( 1) to give 



D 



R 



KR ' 



1. 4_ * 

K KR 



(3) 



For R» y «D, y/R -» D/K. The fraction bound is proportional to the 
amount of DNA reacted when the fraction of RNA hybridized is a small 
proportion of the maximum that could be hybridized at that input ratio. 

As noted above, even that species in excess with respect to its DNA 
site (RNA/ DNA input = 10/1 curve) is not maximally bound at equilib- 
rium under conditions of sufficiently low input concentrations. However, 
this species is always more fully bound (y/R is closer to its maximum 
value) than are the other species, except at very high inputs of RNA and 
DNA, when the binding of all species is maximal. For example, at D = 
10" 3 , virtually all of R L for which sites are available ( 10%) is bound, while 
R 3 and R 4 are about 90% and R 2 about 73% bound. At D = ICt*, the 




Fig. 2. Expected 
of RNA concentration 
of RNA to its DNA ; 



RNA input 

curves for die percentage of input RNA hybridized a 
at the indicated RNA/DNA ratio. Each curve is. for 
ite when the equilibrium constant, K, is 10 J . 



a function 
ne species 



Copied from 09880748 on 11/01/2004 



DAVID K. KKNNEkL 



binding of R t is 90S of maximum while R, and R 4 are 49% and R 2 is only 
381 At D = 10-=, the same values are 493!, 9%, and 8.5%, respectively; Iii 
terms of the total RNA hybrid (2y/R), it can be seen thit in this range 
of R/D inputs its variation with total RNA concentration (R, -|- R 2 -f 
Ra + R*) is mainly a function of the R, species (R/D input =1). When 
all species are fully bound, R 3 and R 4 contribute only 10* and 1%, respec- 
tively, as much hybrid as does either R t or R 2 . Thus, in general, if the 
hybnd binding is submaximal, it is so mainly due to the incomplete bind- 
ing of the RNA molecules that are just filling their sites at the given RNA/ 
DNA input, i.e., for which the RNA/DNA input is in the range of 1. 

One conclusion from this kind of analysis is that as the RNA/DNA in- 
put is increased and more and more DNA sites are filled, the slopes of 
the concentration-dependence curves should become less steep. The most 
simple way to visualize this is to consider the sequential filling of the 
DNA at the RNA/DNA inputs at which (a), the most abundant mENA's 
are just filling their sites and thus contributing a large fraction of the 
hybridized label, compared to (b) the least abundant mRNAs are just 
filling their sites and thus contributing only a small fraction of the labeled 
hybrid. In the latter case, most of the DNA sites that can be filled would 
be saturated at lower RNA/DNA inputs. Therefore, the percentage con- 
tribution to the total hybrid by the more rare RNA species is not as great 
when they fill their sites as was that of the more abundant molecules at 
the RNA/DNA inputs when they just filled their sites. Since the major 
contributors to a concentration dependence are those molecules that are 
just filling their sites, the dependence at high RNA/DNA input, when 
the rare species are being bound, is not as obvious experimentally. Thus, 
the slopes of the curves are less steep; this relationship is seen in Fig. L 
The major contributors to a concentration dependence on yield are 
those species that are just filling their DNA sites, i.e., they are in the same 
range of concentrations as are their DNA sites. Thus, in Fig. 2 at D = 
10- 3 the fraction bound (y/R) is about 75% of the maximum possible at 
that R/D input with at least 95% of the unbound RNA, for which DNA 
sites are available, being species R 2 (R 2 /D 2 = 1). Suppose the total RNA 
input (Rx + R 2 + R 3 + R 4 ) per unit DNA were increased 10-fold. This 
would now mean that each RNA species represented by a curve in Fig. 
2 would be represented by the adjacent curve on the right, e.g., that RNA 
species that had been 1/10 input would now be 1/1 input and this species 
would now provide the major contribution to a concentration dependence 
on yield. Again the amount of total DNA required to give a maximal 
amount of hybrid would be about 10- 2 , i.e., the fractional yield would be 
proportioned to the same amount of DNA. Thus, in a heterogeneous 
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mixture of RNA molecules in which there is a fairly continuous distri- 
bution of numbers per species, the amount of DNA required to give a 
maximum yield of hybrid is the same at all RNA/DNA inputs. In re- 
examining the concentration-dependence curves for pulse-labeled E. coli 
and T4 RNA, it can be seen that this rule is correct within the limitations 
of these measurements; about 150 /tg of DNA is required for maximum 
efficiency in the E. coli system and only 10 jig in the T4 hybridizations 
(Fig. 1). There are obvious exceptions to this rule, e.g., if one is measur- 
ing radioactivity and the specific activities of very rare species far exceed 
those of the other molecules. 

Also, it should be emphasized that the amount of DNA required for 
maximal efficiency will depend on the heterogeneity of the DNA and the 
reaction conditions. For the reaction of E. coli RNA to its DNA and T4 
RNA to its DNA, in the usual ranges of DNA reacted (1 /ig to 100 jig), 
we observed that the hybrid yields were less dependent on concentrations 
at a given RNA/DNA input when both RNA and DNA were completely 
free in solution than when DNA was trapped in a filter (31). This sug- 
gests that the association constant for the reaction is higher when both 
species are in solution. It is probable that when the reaction occurs in a 
filter, the DNA is constrained to migrate within the cavities of the filter. I 
have assumed in this and the following section (III) that the reaction 
can still be treated by the same thermodynamic and kinetic considerations 
as for the reaction in solution, although presumably the activity coeffi- 
cients of the nucleic acids must be different in the two cases. 

Concentration dependence on yield would be less serious if reaction 
conditions were modified so as to increase the association constant. How- 
ever, it is not obvious that this could be done to a significant extent 
without also decreasing the specificity of the reaction. For example, 
complex formation is favored at lower temperatures. However, while the 
yield of complex can be increased by reaction at low temperatures (46), 
e.g., 40°, the specificity as well as the thermal stabilities of the resulting 
duplexes is greatly reduced (20, 45-47). 

It should be emphasized that the concentration dependency for final 
fractional yield of hybrid has no direct relationship to the kinetics of the 
reaction. As is discussed in Section III, the reaction of bacterial or bacte- 
riophage nucleic acid can be complete at 16 hours. The curves of Fig. 
1 show yields at equilibrium. 

B. The Reaction Is Reversible 

Another way of describing a concentration dependency for yield is 
simply to state that the reaction can be reversed. If true, there should 
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Time (hours) Temperature CO 

Fig. 3. Loss of pulse-labeled Escherichia coli RNA from preformed E. coli RNA- 
DNA hybrid (RNA/DNA input of %) as a function of DNA concentration, heft: 
Incubation of filter in 0.5 ml 0.9 M NaCl + 0.09 M Na citrate at 66°C (O) 100 fig 
DNA, (A) 1 ng DNA. Right; Incubation in 0.5 ml 0.15 M NaCl + 0.015 M Na 
citrate for 15 minutes at the indicated temperature. After the final incubations the 
filters were removed and washed, treated with RNase, and again washed before 
counting (31). (#), 100 pig DNA; (O), 1 pB DNA. 

Gillespie and Spiegelman (30) observed complete retention of DNA 
to the filters after 20 hours of hybridization at 66°. We have not observed 
losses either, but a few reports have claimed small but measurable losses 
of DNA in 24 hours at 66° (3-8&) (48). One group claimed 28 and 41% 
losses of pea DNA (7). It would be difficult to prove by measuring loss 
of labeled DNA that the differences between 1 /ig and 100 fig DNA filters 
are not due to loss of the entire DNA -RNA since a very small fraction 
(<5%) of the DNA is complexed; perhaps only the unbound DNA is 
retained. 

However, another experiment was designed to show that the DNA 
remains competent during the 20 hours of reincubation. If the reaction 
were reversible, it should be possible to show exchange when [ 3 H]RNA- 
DNA hybrids are reincubated with [ 14 C]RNA. Furthermore, according 
to Eq. (1), the extent of exchange (percentage) should be greater for 
lesser amounts of hybrid. 
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[ 'H]RNA from 1 jug filters is from DNA sites still competent to form new 
hybrid. 

These results support the conclusion that the reaction of RNA with 
DNA obeys the law of mass action, and as a result, the fractional yield 
of hybrid as well as the loss of preformed hybrid are concentration- 
dependent. 

C. Estimation of Equilibrium Constant for RNA-DNA Reaction 

For the simple two-component system considered in Fig. 2, the 
fraction of RNA bound [y/R at any given concentration of RNA (R) and 
DNA (D)] determines a unique value of K. However, because of the 
relative crudeness of the measurements, K is measured most accurately 
at reactant concentrations sufficient to hybridize a significant fraction of 
RNA but low enough so that the maximal fraction has not been hybrid- 
ized, e.g., so that 50% of the RNA is bound rather than 1% or 99%. 

The E. coli and T4 curves in Fig. 1 show a concentration dependence 
for the hybridization of whole cell RNA to its DNA; as such, they give 
the total hybrid from the reaction of several hundred species of RNA, 
each containing a unique number of molecules. As indicated in the 
preceding discussion, almost all of the concentration dependency is due 
to the RNA molecules that are just filling their DNA sites. It would be 
difficult to estimate a value of K for the reaction since the more abundant 
RNA species would provide an unknown background of bound label. 
However, it should be possible to make a rough estimate of K for the 
reaction of RNA from the lactose operon (lac RNA) to lac DNA (Fig. 1). 
Of necessity, it would be very approximate for a number of obvious 
reasons, including the fact that any bacterial mRNA species is composed 
of molecules that vary from a very small size (perhaps a trinucleotide) 
to a maximum corresponding to the length of the operon (50). However, 
it may be of some value to make these calculations without taking the 
exact answers too seriously. 

Lac RNA is synthesized as a polycistronic mRNA from the three 
genes Z, Y, and A (51). The length of the Z gene is 3700 nucleotides, 
(52, 53), the Y gene is probably about 900 nucleotides in length, judging 
from recent estimates of the permease size (54), while the A gene is also 
about 900, judging from the size of /?-galactosidase transacetylase (55). 
The O gene is very small ( <100 nucleotide pairs) (56). This gives a total 
length of about 5500 nucleotides in a double strand of DNA of M.W. 
(330) (2) (5500) = 3.6X 10°. 

Lac mRNA was detected by annealing it to lac DNA carried in the 
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of about 10" 10 M, which is very close to that derived here from concen- 
tration dependence curves. 

Bishop recently estimated a dissociation constant from the reaction of 
rRNA to DNA (48). The estimated values were K = 0.114 ml" 1 for a 
24-hour reaction or 0.047 /ng ml 1 after extrapolating to infinite time. These 
would give values for K of 0.76 X 10" 10 M and 0.32 X 10" 10 M, respec- 
tively, which are somewhat lower than the other two estimations. In con- 
trast, for SPOl phage RNA-DNA hybrids, Gage (62) estimated a K value 
of 5 X 10" 10 M. This estimate was based on the assumption that the 50% 
value for RNA hybridized at certain concentrations of DNA excess re- 
flected thermodynamic aspects of the reaction only. Since this is probably 
close to the maximum value achievable, it is undoubtedly a very rough 
approximation. However, tentatively, these crude estimations point to a 
dissociation constant for completely complementary RNA-DNA hybrid 
in the neighborhood of 10~ 10 M. 

The dissociation constant, K, can be used to calculate a free energy 
change, AG, for the reaction at 66°C, 

AG = -2.3 RT log K = -2.3 X 2.0 

X'339°JfIog(1.2 X 10- 10 ) = +15,600 cal. 

Thus, there are about 16 kcal/mole energy driving the reaction toward 
association. It is possible to make a rough estimate of the equivalent num- 
ber of base-pairs formed to account for this energy. Two kinds of base 
pairings probably contribute most of the total free energy of duplex for- 
mation (5, 65) (see also references given in 65). The first includes the 
initial base pairings, which is given by 

AG° = — RT In a 

where o- is the equilibrium constant for the "nucleation" event. It has 
been estimated to be approximately 10~ 4 (65). The second bond is that 
involved in pairing of bases adjacent to one or more paired bases; its 
energy is influenced strongly by the stacking energy of the contiguous 
bases and can be given by 

AG 0 = -RT\ns» 

where n is the number of such paired bases. Thus, the total free energy 
is 

AG°total = —RT In (as") 

and K = <rs n . 

The value of s at the reaction temperature (66°) can be estimated 
from the relation: 
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Hi. Kinetics of the Hybridization Reaction 

A. Second-Order Kinetics and Estimation of Genome 
Heterogeneity 

In a simple bimolecular reaction 

A + B^AB 

the rate of reaction is 

dx/dt = h(a - x) (b - x) - hx (6) 

where a and b are the initial concentrations of reactants and x is the 
amount of complex at time (t). When a = b, 

dx/dt = h(a - xf - hx (7) 

If k x X> ki, the reaction proceeds far to the right and it is described as 
"irreversible." If this approximation is acceptable, dxj dt — k t (a — x) 2 , 
and integration and rearrangement gives the fraction bound as a function 
of time 



(8) 



The time to complex one-half of a is 



'. Each filter w 
s at the temper; 



k * = kTa (9) 

The time to complex half the reactants is inversely proportional to the 
initial concentration. This relationship can be used to estimate genome 
heterogeneity when denatured DNA strands of approximate gene size are 
reassociated. 

Let us assume that the DNA from one organism contains 1000 genes, 
each of which has a unique sequence, while the DNA of a second orga- 
nism contains 10,000 unique sequences. Sequences in the first DNA will 
be ten times more concentrated than are the sequences in the more hetero- 
geneous DNA. Since each segment reanneals independently of the others, 
the fraction of total DNA annealed in a given time should reflect this 
heterogeneity. Britten and Kohne (15) plotted the fraction of reasso- 
ciated DNA (x/a) vs a-t for denatured DNA fragments from a variety 
of sources. The expected relationship was found when DNA from bacteria 
or phage was reannealed; the t 1/2 was proportional to the number of 
nucleotide pairs over several orders of magnitude. In contrast, DNA 
from eucaryotic cells contained a fraction that reannealed at c 
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tionately low values of a-t. This suggested that such DNA contains many 
repeafang sequences. For example, 40% of calf DNA was estimated to 
contam sequences that may be repeated 100,000 times while the remain- 
ing 60% of the DNA may contain only unique sequences (15). 

These analyses rely on two assumptions regarding the nature of the 
reaction. First, the reaction is assumed to be second order (3, 12-14, 67) 
so that t 1/2 is inversely proportional to a in Eq. (9). Second, the reaction 
is assumed to be irreversible, i.e., k 2 is assumed to be zero in Eq. (6). 
While in these particular studies this assumption can be made without 
invalidating the broad conclusions, it should be recognized that it is not 
true that the only physical consideration is the collision rate (IS), so that 
even very rare molecules would be annealed if one could only wait long 
enough. Of course, this consideration is crucial for the interpretation of 
many other experiments (Section VII). 

If the reaction were not "irreversible," how would the unique se- 
quences reanneal? In these studies the unique sequences were annealed 
after several days because several milligrams of DNA were used (15). 
Calf DNA has about 700 times more nucleotide pairs per cell than does 
E. coli DNA. At the concentration of 8600 /xg calf DNA/ml used in these 
studies, the unique sequences were as concentrated as are the sequences 
in E. coli DNA at 17 /xg/ml. This concentration should be adequate to 
give 100% binding at equilibrium. At the other extreme, the repetitive 
sequences form duplex structures with a reduced thermal stability (15, 
27, 46, 47). However, even at only 2 M g calf DNA per milliliter, these 
sequences are at an enormous relative concentration (equivalent to 
400 ng of E. coli DNA/ml if there are 100,000 copies of each sequence). 
Thus, the reaction is driven to the formation of complex. However, a 
significant fraction of the isolated complex dissociates very quickly (sim- 
ply by washing) at the same temperature at which it was formed at the 
very high concentrations (15, 27). This demonstrates the obvious rever- 
sibility of the annealing reaction for these species. Of course, the reduced 
stability of these complexes implies that only a fraction (perhaps half) 
of the base pairs are complementary on the repetitious DNA segments 
that have the size used in these studies (15). 

B. First-Order Kinetics and the Reaction of RNA with DNA 

The kinetics of RNA DNA hybridizations are more complex for the 
following reason. Cellular RNA is not only a heterogeneous mixture of 
hundreds of species, but each contains a unique number of molecules. 
In E. coli there are at least 1000 times more molecules of the most abun- 
dant mRNA than of the most rare with a continuous distribution between 
these limits. The rRNA's are 50 to 100 times more frequent than the most 
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common mRNA species (43). As a result, at any given RNA/DNA input, 
almost all the RNA species either are in much higher or in much lower 
concentration than are their complementary DNA sites. Because of this, 
these species should anneal with approximate first-order rather than 
second-order kinetics. The reaction is "pseudo" first order because the 
concentration of one component remains essentially constant. Of course, 
those species whose concentration is close to that of their complementary 
DNA sites will react with second-order kinetics. The net reaction kinetics 
will be a function of the amount of radioactivity in the two classes of 
RNA. 

In a first-order reaction the time to complete the reaction is independ- 
ent of initial concentration of the limiting component. When b » a in 
Eq. (6), 

J = hb(a - x) - hx (10) 
Integration and rearrangement gives 

h b + k, = j In j^- *— (11) 

with Xe the amount of a bound at equilibrium. When x — )iXe, 

t - 0693 (12) 
hn - hb + k 2 . (12) 

The time required for binding half (or any fixed fraction) of the amount 
of a that will ultimately be bound is a constant. The reaction will be about 
97% complete after five such half-lives regardless of the initial concen- 
tration of a, the limiting component. 

This relationship is very important for an evaluation of the reaction 
of RNA with DNA. Unfortunately, no one seems consciously to have 
tested this prediction directly. Many papers show a proportionality 
between concentration of both reactants and initial reaction rate and con- 
clude that the reaction is second order. However, the reaction may pro- 
ceed more rapidly as the concentration of the limiting component is in- 
creased because the final yield is increased proportionately, i.e., one is 
simply titrating the excess DNA sites without changing the r 1/2 of the 
reaction. This could explain the early observation of Nygaard and Hall 
(68) that the reaction was proportional to both RNA and DNA concen- 
tration. The curves (Fig. 2 of ref. 68) showing rates as a function of DNA 
concentration are all initial rates of reaction with excess RNA (except 
25 /xg/m\ DNA) since infected cell RNA starts saturating T2 DNA sites 
at RNA/DNA input of about one (unpublished observations). Thus, the 
relationship with DNA concentration probably has nothing to do with 



Copied from 09880748 on 11/01/2004 



DAVID E. KENNELL 



"""" JUiNNELL 

the reaction kinetics as such i> K m • j 
However, in contrast, C'eS^SmT ^^^^ns. 
studied using a considerable « " ZT*™™™™™ -« rate were 
reflect changes primarily in ^? 'j^f mA hhel *«* P">bably did 
ency was observed again" $ Shop ( j^ ^ ^ 

actions with variable anuMnil^^Sgfr ^ { ^ed re- 
« 8-fold or greater excess of sIuLa ^ ' * ^ W£ * 
<* of DNA compI exed) was foZ^t ^T^u F* 1 ° f hybrid 
were quite proportional to RNA cZIZhL T ^ ^ « ^ 
hted to DNA concentration or to KS£TV SSI ^ DOt at «« re " 
.trahons. The same relation^ ,ZV™7? 01 .™ A and DNA 

concen- 

hybridization in ranges of > fiTlV fo L? e toitJal ra ^ of rRNA 
Unfortunately, I havf not fold t v caTf^ * rDNA Sites <*>• 

above, this would be difficult to 7b™ * ? m S" 9 * As indicated 

cell DNA. It would re^lL^J^^ total 
give excess DNA for all species- if Z ZtSI mpUtS m order *> 

m rate would be influenced bv'uLn T?"S WCTe not met ' cb ^ge* 
KNA bound at e^ibri™ DNA^^t * 

bactenal operon would be suitable fo^tb 3 . !, ^ a S ? ecific 

excess DNA case is a DNA -tZa " Study ' 11,6 rarest to an 
McConaughy reacted ^^^ll^J^ ^ 
«** and limiting amount of DNT^^i^J DNA a con " 
proportional to the concentration „?fil7 T ( f l ^ WtiaI was 
that are not obvious, onlv Z 1 " "T™™ DNA - *»* for reasons 
free DNA. y P about a 12 " f oW excess of filter-bound to 

-^^r^^^^ *. other chemical 

verified by standard kinetic naLS 7 't^' ^ Cou]d be 

acids ,s estimated from , he f„lwL V "J"* 0 " ° f £ coS 
about HI of , he pnl!e ., ™, ™ °™S tactions W . At ,. 

«° < one-tenth ft. co„«» * Tf »» »»™A species that 



DAVID E. KENNELL 

juired to complete the reactions. 

concentration on rate were 
)rt RNA label and probably did 
±e reaction; the RNA depend- 

he Idnetic studies of stable RNA 
Spiegelman (30) followed re- 
id DNA; in all cases, there was 
.. Of course, the yield of hybrid 
Q all cases but the initial rates 
ration; they were not at all re- 
duct of RNA and DNA concen- 
d for the initial rates of rRNA 
3ss rRNA to rDNA sites (48). 
s showing a clear rate-depend- 
: is in great excess. As indicated 
ising any cell RNA with total 
/ RNA/DNA inputs in order to 
mdition were not met, changes 
f the DNA, i.e., the fraction of 
3. DNA enriched for a specific 
tch a study. The nearest to an 
tion on filters. McCarthy and 
>f filter-bound DNA with a con- 
ition (27). The initial rate was 
.--bound DNA, but for reasons 
2-fold excess of filter-bound to 

reaction, like other chemical 
dependent, follows first-order 
iderable excess. This could be 
neans that those RNA species 
n excess all reach equilibrium 
uent they are. Of course, they 
lection II), but if so, it would 

an RNA/DNA ratio of about 
he reaction of E. coli nucleic 
;ervations (43). At this input 
brid of which 5% is in stable 
ie that all mRNA species that 
'ir DNA sites will react with 
•J A's that start saturating DNA 



NUCLEIC ACQ) HYBRIDIZATION 



279 



sites at an RNA/DNA input of one and all less-frequent species— about 
50$ of all mRNA with 50% of the mRNA pulse-label (C m /R m in Bolton- 
McCarthy analysis) (44). Assume for simplicity that k x » k 2 in Eq. (10) 
so that all RNA is bound at equilibrium; thus for any species, dx/dt = 
^(a — x) where & = DNA concentration, a = initial RNA concentra- 
tion and x = amount of hybrid. Integration and rearrangement give 



l -^H^-J as) 



(14) 

Thus, those species that are not fully bound at equilibrium would take 
somewhat longer to reach equilibrium since k 2 in Eq. (12) is opposite in 
sign to k r . However, this factor should be of marginal significance for any 
species contributing measurable amounts of label to the hybrid. The re- 
action given by Eq. ( 13 ) would approach equilibrium as shown by the 
dashed curve of Fig. 6. 

Another class of reactions would include those species that react with 
second-order kinetics, i.e., the RNA and DNA concentrations are within, 
say, a factor of 10 of each other. Assuming again that k t » k 2 , rearrange- 
ment of Eq. (8) gives 

'4al^) (15) 
A plot of the reaction with time is given by the solid curve of Fig. 6. 
Note that it takes much longer to reach equilibrium for reactants at the 
same concentration than it does when the concentration of one of the 
reactants is greatly reduced. The time required to anneal half of the RNA 
is comparable, but it takes twice as long for 75% of the RNA to react and 
significantly more time to reach equilibrium. 

However, the rate of reaction rapidly approaches first-order kinetics 
as the concentrations of reactants become different. This is shown by the 
dot-dash curve of Fig. 6, which gives the kinetics for RNA species that 
are one-half the concentration of their DNA sites, i.e., in this example, 
they saturate at RNA/DNA inputs of 1/5 (the most abundant mRNA 
species of E. coli) (43). The equation generating that curve is derived 
from Eq. (6) (again with k 2 — 0) 

■t-rA-Vi^^A d6) 
h(a - b) a(b — x) ' 

It can be seen that 98% of the RNA is bound (dot-dash curve) in less 

than twice the time required to bind 98% in the first-order reaction 

(dashed curve). 
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Fig. 6. The fraction of RNA bound as a function of time in first- and second- 
order reactions. In all cases the concentration of the DNA site is the same (D„). In 
the first-order reaction, tire concentration of RNA (H) is <D„. In the second-order 
reactions R = 0.5 D„ or R = D„. 

A third class of reactions includes those RNA species whose concen- 
trations far exceed the concentrations of their homologous DNA sites. 
At an RNA/ DNA of 1/10, these obviously would include only the stable 
RNA's. These would react with the first-order kinetics of Eq. (13) except 
now b = RNA input and a = DNA input. Since b in this class is greater 
than b in the first class, these should be the fastest reacting species in the 
total RNA population. 

At an RNA/DNA input of 1/10, the slowest reacting sequences would 
be those for which RNA = 0.5 (DNA). Even these would reach 90% 
binding almost as fast as the more rare species that react with first-order 
kinetics. Of course, the fraction of label that reacts with first versus 
second-order kinetics will vary with RNA/DNA input. There will be cer- 
tain critical ratios at which a significant fraction of label will react with 
second-order kinetics; one such ratio would be RNA/DNA = }l at which 
the most abundant mRNA's just equal the concentration of their DNA 
sites (43). However, at almost all RNA/DNA inputs, the bulk of the 
radioactivity would participate in reactions in which one component is in 
sufficient excess to give approximate first-order kinetics. 
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If the above assumptions are correct — and they have yet to be tested 
rigorously but evidence so far is fairly supportive — it means that, in 
general, hybridization reactions should reach equilibrium much sooner 
than do DNA reannealing reactions for which the reactants ( -f- and — 
strands) are exactly equal in concentration. Furthermore, unlike a second- 
order reaction, the time to reach equilibrium is independent of the con- 
centration of limiting component; it is not true that the very rare species 
would become bound if we could just wait long enough. They reach 
equilibrium as soon as more concentrated species but may not be fully 
bound because of thermodynamic, not kinetic, considerations. 

Finally, the rate of the reaction is dependent on concentration of the 
excess component. At double the concentration the reaction will reach 
equilibrium about twice as fast (Eq. 13). Of course, the same applies to 
both reactants in a second-order reaction. This provides a means for de- 
termining if the reaction is essentially complete in a given time. It is clear 
that the annealing reaction of bacterial, as well as certain classes of 
eucaryotic cell RNA, can be complete by 20 hours provided the concen- 
trations of reactants are reasonably high (30, 31, 49, 69-75). This was 
verified over a wide range of RNA/DNA inputs even with nucleic acid 
concentrations so low that most RNA -DNA pairs were not associated at 
equilibrium (31). 



IV. Estimating Relative Transcribing Activities of DNA Sites 

Most investigators use hybridization to obtain a simple "yes-or-no" 
answer to some question. Such questions usually ask whether some cell 
or tissue is making any RNA complementary to a given DNA. Often the 
investigator can ignore any chemical and technical aspects of the reac- 
tion and obtain the right answer. However, as soon as the biological ques- 
tion includes a quantitative answer, e.g., 57% of the RNA has some 
characteristic, the chemistry of the reaction becomes very important to 
the inteipretation. 

A. A Simple Case.- Fraction of Genome for rRNA 

Probably the most simple case is to estimate the fraction of DNA 
complementary to rRNA. The rRNA can be purified virtually free of other 
species, then hybridized at increasing RNA/DNA inputs to determine the 
input at which all complementary sites are just filled. From this value and 
the fraction of RNA hybridized, it is a simple calculation to derive the 
fraction of DNA complementary to rRNA (24, 43, 71, 73, 74, 76, 77). 
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tion of mRNA mass in this species. This condition is only fulfilled in 
steady-state labeling, i.e., long-labeled RNA for which all molecules have 
the same specific activity. It is easily attained when working with micro- 
organisms but is difficult to achieve in many eucaryotic cell studies. 

The titration of DNA by pulse-labeled RNA can be of interest if it 
is compared to the titration by long-labeled RNA. If the titration curves 
are the same, i.e., a given fraction of DNA is complementary to the same 
fractions of pulse-label and RNA mass, it suggests that on the average 
the number of molecules per species is determined by rate of synthesis 
rather than stability. This was concluded with respect to mRNA of E. coli 
(43). We may note that this conclusion is supported by recent observa- 
tions that the rate of chemical degradation of one E. coli mRNA (lac 
mRNA) is identical to the average rate of degradation of the total mRNA 
(50). 

An alternative way to analyze the data from a titration is to use the 
equations of a Langmuir adsorption isotherm for analysis of the filling 
of a surface. In this case, the surface is DNA, and it is being filled by 
RNA. The equilibrium constant 



K = 



(R - S)(oD - S) 



where R = RNA input, D = DNA input, S = RNA bound, and a = the 
fraction of DNA competent to accept RNA. Rearrangement gives 

A plot of D/S against 1/R — S has a slope of K/a and an intercept of 
II a. The data from the titration of the E. coli chromosome (43) were 
replotted this way, and as expected, the slope changed as a function of 
RNA/DNA input. This follows from the fact that the reaction is the sum- 
mation of hundreds of sites being filled, and as a result, a is really not a 
constant. However, the intercept of 11 a, extrapolated to correspond to 
infinite RNA input, gave the same value for a as had been derived by 
the McCarthy-Bolton analysis: 10% of the DNA. This value of 10% was 
interpreted to mean that all the mRNA detected over the range of abun- 
dances measured (about 2000-fold) was transcribed from 20% of the 
potential gene sites (43). 



V. Quantitative Limitations of Hybridization 

Recently, Bishop et al. challenged the use of hybridization for quan- 
titative studies (48, 69, 70, 78). Basically, two considerations seem to be 
under question. The first is a kinetic one: the reaction is not complete in 
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RNA/DNA inputs at which DNA sites are just being filled, sites that are 
partially filled with incomplete lengths of mRNA may be further filled 
by another small mRNA that is nonoverlapping with the first. The net 
result of such a terminal filling would be to make the transition from 
excess to limiting DNA less sharp. 

A second and more important consideration applies to such studies 
with any defined RNA. As discussed in Section II, at any given RNA/ 
DNA input the yield of hybrid at equihbrium is a function of concentra- 
tion of reactants. Thus, at sufficiently low concentrations of reactants, not 
all the RNA molecules are bound for which complementary DNA sites 
are available (Fig. 1). In the gal mRNA -DNA work (70), it is not pos- 
sible to evaluate the effect of concentration since the concentrations are 
not given. However, in the saturation of tyrosine tRNA sites, which is 
considered a very clear case (70, 78), the reactant concentrations are 
given (80, 81). In the Landy et al. study (80), 2 jug of <£80dSu3 -f DNA 
is saturated by 0.8 /xg of RNA input. At this ratio, 0.0016 /xg of tRNA is 
bound to the same amount of DNA (0.08%). These are extremely low 
concentrations of reactants. The concentrations in the other study are 
higher, but still very low (81). We observed obvious concentration de- 
pendence on yield with 0.06 jug of rRNA reacting to 0.56 jug of rDNA 
(43) and with about 0.001 /xg of lac RNA to 0.2 ^g of lac DNA (Fig. 1). 
In fact, the only obvious interpretation of a curvilinear approach to satura- 
tion is that there is a concentration dependence for yield. It means that 
more RNA has to be added to fill DNA sites that are available. As em- 
phasized earlier ( 31 and Section II ) , the best way to prove that this de- 
pendency exists is to choose one RNA/DNA input and measure fractional 
yield as a function of input concentrations. In contrast, by adding more 
RNA per DNA one may also be observing the actual titration of sites as 
well as a changing "efficiency" of reaction. 

In fact, some saturation curves have shown remarkably sharp transi- 
tions in slope when going from unsaturated to saturated DNA sites, e.g., 
the saturation of rRNA sites (71, 83 and unpublished observations) and 
even the saturation of trp mRNA sites (if the curve of Fig. 5, reference 
Si, is redrawn to fit the points better). 

A further argument for the inadequacy of hybridization for quan- 
titative estimations is based on a complete titration of the E. coli chromo- 
some. Instead of concluding that only 20% of the potential gene sites make 
significant levels of RNA (43), the author arrived at a value of 28%, and 
furthermore felt that even this figure might be low (70). A very important 
technical consideration throws doubt on these results. With 2 mg/ml 
RNA input, 0.12 /xg of RNA was hybridized per microgram of DNA. 
Unfortunately, the actual amounts of RNA and DNA reacting are not 
given, but if it were 1 xxg of DNA, this would give 0.006% of the RNA 
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peting for a common DNA. Competition by nonidentical RNA molecules 
will be discussed in the section on hybridization of eucaryotic cell nucleic 
acids. 

A. Some Basic Considerations 

Certain basic principles should be considered in such studies. ,* 

1. RNA molecules compete on a mass basis while usually the measure- 
ments are of radioactivity bound. Thus, if the labeled RNA molecules 
have different specific activities, the amount of label excluded by the 
competitor may bear little relation of the amount of RNA excluded. 

2. Only labeled RNA molecules that are in excess of their DNA sites 
(without competitor) will have their binding decreased to an extent pro- 
portional to the amount of the competitor RNA molecules added. 

B. A Not So Simple Case: Estimating the Fraction of Pulse- 
Labeled RNA That Is rRNA 

Probably the simplest example of competition is directed to show what 
fraction of pulse-labeled RNA is ribosomal (rRNA). Ideally, one would 
want three different RNA samples: (a) RNA with label only in rRNA; 
(b ) purified unlabeled rRNA; and (c) the pulse-labeled RNA ( [ 3 H]RNA) 
being studied. The labeled rRNA should be titrated to excess DNA to 
determine the RNA/DNA input at which the sites for rRNA are just filled. 
For E. coli this ratio is about 1/160 to 1/200 (43, 49). Thus, at RNA/DNA 
inputs < 1/200, DNA sites will be in excess and proportional competition 
will not be observed. However, it would not be wise to use higher 
[ 3 H]RNA/DNA inputs, since at higher RNA/DNA input the fraction of 
hybrid that is mRNA is greater. For example, at RNA/DNA = 1/10, only 
4% of E. coli pulse-labeled «H in the final hybrid is rRNA, while at 1/200 
it is 50% (43); in the former case only 4% of the label could be excluded 
by excess rRNA. 

How much competitor RNA should be added? Offhand, one might 
want to add as much as possible. This would be satisfactory if one could 
be sure that the purified rRNA competitor is not contaminated by other 
RNA species. For example, if the competitor RNA/ [ S H] RNA were 1000 
and 0.2% of the competitor were mRNA, then 2 units of pHjmRNA would 
be present per 200 units of DNA when the [ 3 H]RNA/DNA = 1/200. This 
would be sufficient to saturate E. coli sites for a significant fraction of the 
most abundant mRNA. The result would suggest that rRNA is a much 
larger fraction of pulse-labeled RNA than it actually is. For this reason, 
it is usually safer to choose a more conservative ratio of [ 1 H] to [ 3 H]RNA 
(say 50). Because of this, it is best to avoid an extremely low [ 3 H]RNA/ 
DNA input (e.g., < 1/1000) since the reduction in 3 H bound would not 
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be proportional to competitor RNA added and an additional calctilation 
would be necessary. Thus, in this case the [ 3 H ] RNA/ DNA would best 
be close to, but not greater than, 1/200. 

C. A Very Complex Case: Estimating Relatedness of RNA 
Populations from Different Cells 

The most frequently used competitor experiments involve the com- 
petition of one total cell RNA sample against another. In this case the net 
competition is the summation of the simultaneous competitions of hun- 
dreds or thousands of species. This procedure has been used to study 
transcriptional patterns as a function of infection lime by virulent phages 
(85-87) and similarities between RNA from different cucaryotic cells or 
tissues (e.g., 75, 88-92). The same general rules apply to this more com- 
plex situation, i.e., the [ 3 H]RNA/DNA input as well as the [ 3 H]RNA/ 
[ 1 H]RNA inputs must be chosen to provide answers to a specific question. 

There are several hundred species of mRNA in a cell, and probably 
the numbers of molecules per species also vary over a several hundred- 
fold range. Therefore, in practice, since the competitor competes against 
the 3 H that would otherwise form hybrid, its effectiveness as a competitor 
depends only on its ability to compete against those [ 3 H]RNA species 
that form the bulk of the hybrid. The particular set of species forming 
the bulk of hybrid will be different at each different [ 3 H]RNA/DNA 
input and include more species as the [ 3 H]RNA/DNA ratio is raised. 

In the limiting case at very high [ a H]RNA/DNA input, all DNA sites 
will be filled, and all RNA species competed against. In theory this would 
be the condition from which to make generalizations about the relatedness 
of the active DNA sequences. Each DNA site would be filled with one 
[ 3 H]RNA molecule (no competitor) irrespective of the frequency dis- 
tribution of RNA molecules per species. In practice this measurement is 
difficult for technical reasons; enormous amounts of RNA are required 
and the fraction of 3 H hybridized becomes intolerably low. 

Some competition experiments have tended to the opposite extreme, 
i.e., very low [ 3 H]RNA/DNA inputs (say <1/10) to estimate "similar- 
ities" between total RNA from cells or tissues. A few comments should 
be made. First, ribosomal and transfer RNA comprise the bulk of the 
RNA in all cells, while the total mass of mRNA is a minor fraction (about 
1% to 8%). Thus, the E. coli case (mRNA = 3% of RNA), should serve as 
a satisfactory example for most cells. At a [ 3 H]RNA/DNA ratio of 1/25, 
approximately 17% of the total E. coli RNA mass can be bound to DNA 
sites. Of this final hybrid, 82% would be stable RNA and only 18% would 
be mRNA (Fig. 7). Since RNA molecules compete on a mass basis, a 
competing cell RNA could eliminate over 80% of the RNA mass from the 
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hybrid without having a single mRNA species in common. Of course, the 
percent of 3 H eliminated would also be a function of the distribution of 
3 H in the stable and mRNA fractions. If 50% of the 3 H were in mRNA 
then elimination of stable RNA from the hybrid would only exclude 14!? 
of the S H. However, in many eucaryotic cells mRNA may represent a 
much lower fraction of synthesized RNA (93). If it were 10% rather than 
50%, competing out only stable RNA could exclude 46% of the 3 H. 

Second, again using E. coli as an example, regardless of the 3 H dis- 
tribution, there would have to be at least a 10-fold increase in the mass 
of the most abundant mRNA's before even they would saturate their DNA 
sites (Fig. 7). Such molecules might be present in the competing RNA, 
but if they were present at only one-tenth their concentration in the 
[ 3 H]RNA, a 100-fold excess of total RNA ( 1 H/ 3 H = 100) would be neces- 
sary in order to saturate these sites. Clearly, such a starting [ 3 H]RNA/ 
DNA input might give very misleading results regarding similarities be- 
tween molecules in the mRNA fraction. 

Most competition experiments start with some intermediate 
[ 3 H]RNA/DNA input and attempt to estimate relatedness of mRNA from 
a different cell or tissue. At an [ 3 H]RNA/DNA input of 1/4, 4% of E. coli 
RNA forms hybrid of which about 37% is stable RNA. Thus, even at this 
ratio, 37% of the hybridized RNA mass could be eliminated by competitor 
RNA lacking mRNA in common. However, if 50% of the 3 H were in 
mRNA, elimination of rRNA would only eliminate 0.75% of the 3 H from 
hybrid. In this case, successful competition would indicate similarities in 
the mRNA. However, the bulk of the mRNA is transcribed from a small 
fraction of the active DNA; in E. coli, 80% of the mRNA is transcribed 
by only 10% of the active genes (43). As a result, at an [ 3 H]RNA/DNA 
input of 1/4, 80% of the hybrid could be eliminated even though 90% of 
the species of mRNA were nonidentical. While such a result would sug- 
gest that 80% of the 3 H molecules have competitors in the competitor 
RNA, it says very little about similarities between the species or kinds 
of molecules in the two samples. This simple but important distinction 
is usually overlooked. 

Another consideration in this kind of experiment makes questionable 
the conclusions almost always derived from competitions of this kind. 
Invariably, the ratio of competing RNA to [ 3 H]RNA is increased to some 
extremely high value ( >100). If one follows the decrease in 3 H hybrid- 
ized as this ratio is increased, it is clear that it does not decrease pro- 
portionately. Two variables contribute to this. First, the DNA sites for the 
abundant species, which contribute the bulk of the 3 H hybrid, are not 
all filled to the same extent at the [ 3 H]RNA/DNA input, The other reason 
is that the 3 H counterparts of these abundant [ 3 H]RNA may be a minor 
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Fig. 8. The percentage of [ 3 H]RNA hybrid as a function of competitor [*H]RNA 
concentration (100% = value with no competitor). One microgram of [ S H]RNA is 
hybridized to DNA from which it was transcribed. The lower curve represents com- 
petition by the homologous unlabeled RNA. In the other three curves, 80% of the 
[ 3 H]RNA is in species present in the ['H1RNA. (5%) of the competitor [ ] H]RNA 
competes with 80% of the [ 3 H]RNA for sites that are all saturated with no [*H]RNA 
present. (15%) of the -competitor RNA competes for DNA sites for 80% of the 
[ 3 H]RNA. There are two abundance classes for these molecules, and the 'H molecules 
all have the same specific activities. The first class contains 70% of the 3 H, and the 
second 10%; 5% of the ['HJRNA is homologous to the first 3 H class and 10% to the 
second. The [ 3 H]RNA/DNA is such that the first class would just saturate its DNA 
sites. (30%) of the competitor RNA is homologous to 80% of the [ 3 H]RNA. There 
are two frequency classes as in the previous case, except that the 70% and 10% classes 
of 3 H are now 60% and 2055, respectively, and the 5% and 10% 'H classes are now 
10% and 20%, respectively. 



made are: (a) 50% of the RNA molecules being made are common to 
both cells. This is unjustified for reasons given above, (b) 50% of the 
species of RNA are identical. This interpretation is even further from 
reality than the first but invariably authors casually substitute species 
distributions for molecule distributions at some point. It is incorrect be- 
cause most of the label being excluded from hybrid is in a small fraction 
of the total species, (c) Finally, more cautious investigators add the 
phrase "on the average" to cover the possibility that the 50% of the S H 
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excluded could be in some group of RNA with widely different numbers 
per species both for the labeled RNA and unlabeled competitor. While 
this phrase probably does protect the conclusion because it is undefined 
(average of what?), it takes all significance out of the result. 

An analogy comes to mind. The average year round temperature in " 
St. Louis is listed as a comfortable 59°F. This fact is unimpeachable; it 
is <20°F most of four months, >90°F most of another four, and every- 
thing in between during the remainder. 



VII. Hybridization of Nucleic Acids from Eucaryotic Cells 

A. Factors Adding to the Complexity of Interpretations 

The importance of the hybridization reaction is most severely tested 
in the reactions of RNA and DNA from eucaryotic cells. In general the 
more complex an organism, the more DNA per cell (haploid set) of the 
organism (15). The haploid DNA content of a mammalian cell may be 
1000 times greater than that of E. coli. The work of the Carnegie group, 
especially, has shown that a fraction of the eucaryotic RNA -DNA com- 
plex formed during most hybridizations has a relatively low thermal 
stability (46, 47). It is this fraction that includes DNA sequences that 
reanneal much more rapidly than expected from kinetic considerations 
(15, 27) (Section III). Both these observations led to the proposal that a 
fraction of eucaryote DNA contains stretches of nucleotides that are re- 
peated hundreds or thousands of times and probably differ from each 
other to a relatively minor extent (15, 27, 45, 46, 94). Britten and David- 
son proposed a model that assigns a function to the repetitive sequences; 
the model takes the original Jacob-Monod operon system (95) and am- 
plifies it to include multiply interacting operons (96). Much of the RNA 
that contains repetitive sequences may include the rapidly turning-over 
RNA that is confined to the nucleus; this nuclear RNA seems to represent 
the bulk of RNA being synthesized in the cell (93, 97-101). Such RNA 
could play a regulatory role by binding to specific operator sites on the 
chromosomes (102, 103). 

The presence of repetitive sequences increases the complexity of the 
reactions of DNA with DNA or of DNA with RNA. However, it does not 
make these reactions of any less value for the study of gene expression in 
eucaryotic cells than they are for such studies with bacteria and phage. 
The same chemical reaction is involved, and the same general considera- 
tions regarding rates and yields should still apply. 
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B. Stringency of Reaction Conditions and Thermal 
Stabilities of Complex 

Partially mismatched hybrids are readily formed when most, but not 
all, of the nucleotides of RNA molecules are complementary to certain 
DNA strands. The T m may be reduced about 1° for every 1% increase 
in mismatching of the complementary strands (26, 104). However, it 
should be emphasized that the extent to which mismatched hybrids can 
form is relative and dependent upon the "stringency" of the reaction 
conditions. The stringency refers to the extent to which the reaction con- 
ditions allow only completely complementary structures to form. Gen- 
erally, stringency is proportional to temperature and inversely propor- 
tional to salt concentration. 

Denis (105) showed that amphibian hybrids are of variable stability, 
and Church and McCarthy (45) compared the reactions of viral, bac- 
terial, and mammalian nucleic acids. B. subtilis DNA appears to contain 
no repeating sequences (27); however, mismatched hybrids are formed 
when the reaction occurs at 50° rather than >60° (45). Partial mismatch- 
ing was detected by an examination of the thermal profile for elution of 
the RNA from the hybrid. In general, the less stable a hybrid, the lower 
is its r m for elution. It follows that the more variable the stability of total 
hybrid, the broader is the temperature range of elution. As the tempera- 
ture of hybridization is raised, hybrid stability and homogeneity with 
respect to stability both increase when mouse RNA and DNA react. At 
75° the melting curve was as steep as the steepest B. subtilis curve and 
had a T m of about 80° in 0.15 M NaCl + 0.015 M Na citrate (45). 

Analyses of thermal elution profiles are therefore of extreme impor- 
tance, especially in hybridization studies with eucaryotic cells. In fact, 
in most eucaryote cell hybridizations, they are absolutely essential in 
order to establish the significance of the hybrid yield. Clearly, 20% of the 
RNA hybridized with variable degrees of mismatching means something 
different than 5% of the RNA all hybridized with complete complemen- 
tarity. Both results could be obtained at the same RNA/ DNA input with 
different conditions of salt and temperature. Similarly, estimates of the 
extents of similarity of two RNA populations by competition experiments 
are greatly affected by reaction conditions (45). This follows directly 
from the preceding considerations: an RNA molecule that is similar, but 
not an exact copy of another RNA, will compete for a common DNA site 
if the stringency conditions are sufficiently relaxed. 

Those hybrids with lower association constants due to mismatching 
should elute from the DNA at lower temperatures. In theory, both the 
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and are derived from 0.1% of the DNA. There is no information on the 
stability of this fraction, but it probably includes most of the mRNA, and 
in mammals there is some suggestive evidence that mRNA's are quite 
stable (93). In any case, for purposes of this example, assume the specific 
activity of this fraction to be one-tenth that of the preceding repetitive 
fraction. This would mean that the unique fraction was 72 of the RNA 
mass. It could be somewhat more if the tRNA were derived from unique 
sequences, but it is assumed that rRNA (comprising about 80% of the 
bulk RNA) would be derived from the intermediate class containing the 
fairly repetitive sequences (24, 72-74). 

At an RNA/ DNA input of one, 7% of the RNA will be reacting with 
0.7% of the DNA (the unique sequences). Thus, the average con- 
centration of an RNA species will be 10 times that of its DNA site. How- 
ever, the concentrations of each species will vary over some wide range. 
Assume 1000 different species [about the number required to handle 
metabolism (106)] with a continuous distribution of numbers of mole- 
cules per species; the most abundant contain 100 times as many molecules 
as the most rare and account for 1% of the mass of this fraction. The most 
abundant RNA in this class would be 100 times more concentrated than 
the DNA sites to which it anneals. Thus, no more than 1% of this species 
could be bound and at this low concentration the final yield would un- 
doubtedly be much lower. For this species the reaction would be first 
order at a rate determined by RNA concentration rather than DNA con- 
centration Eq. (7). However, this RNA species would be about 1000 
times less concentrated than are the repetitive DNA sequences; if h were 
the same in the two cases, the reaction would be correspondingly slower 
but still the fastest of any species in this class. At the other extreme in the 
unique class, the concentration of the least abundant RNA would be 
about equal to that of its DNA site. In this case the reaction would be 
second order with a rate proportional to concentration of both reactants. 

The most important species to consider in the intermediate class are 
the rRNA molecules. Assume the 18 S and 28 S species each represent 
40% of the total RNA mass and each is homologous to 100 DNA sites or 
about 0.02% of the DNA [the mouse has 3 X 10 9 nucleotide pairs per 
genome (15); so if 28 S RNA has a molecular weight of 10", then 100 
genes for 28 S RNA would require 0.023% of the DNA]. Since the 28 S 
RNA represents 40% of the total RNA, it would be about 2000 times more 
concentrated than its DNA sites at RNA /DNA inputs of one. The 0.05% 
that could anneal would do so as fast or faster than even the RNA for 
the satellite DNA since the rate would only be proportional to rRNA 
concentration. While die rRNA label would be an insignificant fraction 
of the hybrid at this input ratio, it would become significant at very low 
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its DNA (43, 71). In the former case, the concentrations of the ribosomal 
gene sites are several hundred times lower than in the bacterial case. 

E. Presaturation Competition 

There can be extensive mismatching when nucleic acids from eu- 
caryotic cells are annealed. Most investigators wish to estimate degrees 
of similarity between RNA populations from different cells or tissues. 
This is usually done by competing one RNA against another for a single 
DNA. Naturally, the mismatching of both the unlabeled competitor and 
the labeled RNA will tend to complicate any interpretation of the 
results. One modification of the procedure that has been used entails 
hybridization of the unlabeled competitor RNA before the labeled RNA. 
This procedure has been termed "presaturation" competition (108, 109). 
Soeiro and Darnell (108) showed that less labeled RNA is excluded by 
presaturation than by "simultaneous" competition with an unlabeled 
competitor RNA. However, McCarthy et al. (45, 92) found no difference 
in the two procedures with respect to exclusion of labeled RNA. 
Presaturation competition is favored by Soeiro and Darnell because 
they believe that once formed, the hybrid complex is stable to dilution. 
In contrast, mismatched hybrids are bound imperfectly and reversibly. 

In theory, this modification should be of value to distinguish perfect 
from imperfect hybrids if the concentration of hybrid is such that the 
degree of reversibility in the absence of excess RNA is a function of the 
stability of the hybrid. In Section II, the reversibility of E. coli RNA- 
DNA hybrids was shown. Even though such hybrids are very stable as 
judged by high melting temperature, a large fraction of RNA could be- 
come dissociated from the DNA upon reincubation of the hybrid in the 
absence of free RNA. The fraction of RNA dissociated was a function 
of two related variables: the dissociation constant K and the concen- 
tration of hybrid (Fig. 3); i.e., it obeyed the law of mass action. There- 
fore, it follows that those [ 1 H]RNA hybrids for which the equilibrium 
constant is higher will tend to dissociate more upon reincubation and be 
replaced by [ 3 H]RNA that might or might not give more stable com- 
plexes. An [ 3 H]RNA complex might be less stable than its competitor but 
displace the [ J H] complex if the [ 3 H]RNA concentration were sufficiently 
high. 

If the ^HJRNA that competes with the [ 3 H]RNA for a common DNA 
site is present in great excess over the [ 3 H] species, it will tend to exclude 
3 H from the hybrid in a "simultaneous" hybridization even though it is 
more poorly matched. However, in a "pre-saturation" hybridization the 
same relatively unstable [ J H] hybrids would tend to exchange with the 
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